Skip to main content

Reply to this post | See parent post | Go Back
View Post [edit]

Poster: Alex.brollo Date: May 3, 2014 7:19am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Thanks for interest, Jeef; no, I've in mind something different, much simpler. I'm thinking simply to create a "horizontal collection" named wikisource, grouping IA books that are used as sources for wikisource. As perhaps you know, wikisource does something that can be seen as the "final derive step" of digitalized books, t.i. converts OCR text into a good (we hope) hypertext, fixing scannos, adding formatting and adding internal and external links. Simply adding a "collection:wikisource" tag to IA items that have been used, or are in use, by wikisource, a list of advantages would pop up:
1. it would be simple to get a list of such items;
2. their metadata could be verified, aligned and refined by admins of collection, using batch routines too;
3. both projects could gain active users;
4. when wikisource gets a good html hypertext, that presently can be derived as pdf or ePub, a copy of such finel result could be uploaded into the list of files of source IA item.

Presently I'm uploading many books into opallibriantichi collection (869 today); I'll add for sure a metadata/a link to any of them used in a proofreading procedure into wikisource; but this subgroup of opallibriantichi+wikisource items will be only a small fraction of IA items used by wikisource projects, that are spread into a variety of different collections and managed by a variety of admins.

Recently there's a new, bold project - wikidata - collecting and organizing metadata about anything into wiki projects, and there's a "wikidata book task force" working about book metadata; a good data alignment between wikidata and IA would be too very interesting and useful.

Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: May 3, 2014 9:14am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

still a bit unclear, are you suggesting modifying files in existing items or copying files, modifying them and making them available separately?

also, we would not add files to existing items unless they are from the original uploader. any modified or new files would need to be uploaded as separate, new items.

you can do this with any publicly available items. and, you could simply add a subject tag unique to your project that would make finding them through search simple.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: May 3, 2014 3:16pm
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Just as a test, I edited metadata of item RimeFrancescoPetrarca, adding a subject:wikisource items and a link to it.wikisource into notes field.

As you told, this is sufficient to retrieve items used by wikisource and to get a backlink to wikisource version; it could be a first step.

Reply to this post
Reply [edit]

Poster: Nemo_bis Date: Jun 24, 2014 7:18am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

We don't plan to change or reupload any file, only to have an easier (read) access to IA items used by Wikisource. For instance "list Persian language books used on Wikisource", to make an OCR (that ABBYY doesn't do): trivial with internal search, a bit harder otherwise. (Most Wikisource users are not very technical.)

Even a "Wikisource" keyword, or any other searchable metadata field, would be wonderful as well.